Multiple pronunciation model for Amharic speech recognition system

نویسنده

  • Solomon Gizaw
چکیده

In this paper the research have tried to show the pattern variations of sound units in Amharic language for multiple pronunciation model. This are variation of sound units at lexical level due to dialects. After that an attempt to build a pronunciation dictionary for Automatic Speech Recognition (ASR).At last comments and recommendations are included. Amharic is an official language of Ethiopia. It is a Semitic language that has the greatest number of speakers after Arabic. Amharic has five dialectical variations spoken named as: Addis Ababa, Gojam, Gonder ,Wollo and Menz[1]. The Amharic writing system uses multitudes of ways to denote compound words and there is no agreed upon spelling standard for compounds. As a result of this and of the size of the country leading to vast dialectal dispersion, lexical variation and homophony is very common [2]. Pronunciation variation is a phenomenon observed within a speaker or within a group of speakers of the same dialect or among speakers across dialects of the same language. Pronunciation variation deals with the different ways of speaking a given word. Pronunciation variation modeling has been studied in the field of speech synthesis and recognition to improve performance of the corresponding speech systems [3]. The Amharic orthography as it is represented in the Amharic character set consists of 276 distinct symbols. These symbols are classified into four groups. In the first category (33*7=231) there are thirtythree core orthographic symbols, each of which has seven different shapes, usually known as orders, to represent the seven vowels. Each consonant and the seven vowels in combination represent CV syllables[4]. Each of these consonant and vowel grapheme can appear independently or can form a combinant letter. Each consonant can form CV pattern except with the vowel /ix/ (called epenthesis vowel)[5] . The second category (4*5=20) consists of four labio-velar symbols, which have five orders. The eighteen labelized consonant, which have only one order, are the third category. The fourth category is the representation of numbers from 1 to 10 and multiples of 10 each with different symbols [4]. The Amharic language script is called Ethiopic. Even though the vowel modification is not entirely semantic, the Ethiopic script is a syllabic structure [5]. The first International Workshop on Spoken Languages Technologies for Under-resourced languages (SLTU 2008) The first International Workshop on Spoken Languages Technologies for Under-resourced languages (SLTU 2008)

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Grapheme-to-Phoneme Conversion for Amharic Text-to-Speech System

Developing correct Grapheme-to-Phoneme (GTP) conversion method is a central problem in text-tospeech synthesis. Particularly, deriving phonological features which are not shown in orthography is challenging. In the Amharic language, geminates and epenthetic vowels are very crucial for proper pronunciation but neither is shown in orthography. This paper describes an architecture, a preprocessing...

متن کامل

Experimental detection of vowel pronunciation variants in Amharic

The pronunciation lexicon is a fundamental element in an automatic speech transcription system. It associates each lexical entry (usually a grapheme), with one or more phonemic or phone-like forms, the pronunciation variants. Thorough knowledge of the target language is a priori necessary to establish the pronunciation baseforms and variants. The reliance on human expertise can pose difficultie...

متن کامل

Syllable-Based Speech Recognition for Amharic

Amharic is the Semitic language that has the second large number of speakers after Arabic (Hayward and Richard 1999). Its writing system is syllabic with Consonant-Vowel (CV) syllable structure. Amharic orthography has more or less a one to one correspondence with syllabic sounds. We have used this feature of Amharic to develop a CV syllable-based speech recognizer, using Hidden Markov Modeling...

متن کامل

A speaker independent continuous speech recognizer for Amharic

The paper discusses an Amharic speaker independent continuous speech recognizer based on an HMM/ANN hybrid approach. The model was constructed at a context dependent phone part sub-word level with the help of the CSLU Toolkit. A promising result of 74.28% word and 39.70% sentence recognition rate was achieved. These are the best figures reported so far for speech recognition for the Amharic lan...

متن کامل

Automatic speech recognition for an under-resourced language - amharic

In this paper we present the development of an Automatic Speech Recognition System (ASRS) for Amharic using limited available resources and the freely available speech toolkit (HTK). There are phonological, dialectal, orthographic and morphological features of Amharic that challenge the development of ASRSs. The problem of resource scarcity is also a hindrance to the research and development in...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008